Picture for Yuhao Li

Yuhao Li

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Add code
Jan 29, 2026
Viaarxiv icon

Emergent Specialization in Learner Populations: Competition as the Source of Diversity

Add code
Jan 16, 2026
Viaarxiv icon

Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark

Add code
Sep 09, 2025
Viaarxiv icon

Kernel Two-Sample Testing via Directional Components Analysis

Add code
Aug 12, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Figure 1 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 2 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 3 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 4 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Viaarxiv icon

A Culturally-diverse Multilingual Multimodal Video Benchmark & Model

Add code
Jun 08, 2025
Viaarxiv icon

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

Add code
May 30, 2025
Viaarxiv icon

Dynamical Label Augmentation and Calibration for Noisy Electronic Health Records

Add code
May 12, 2025
Viaarxiv icon

DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding

Add code
Mar 13, 2025
Figure 1 for DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
Figure 2 for DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
Figure 3 for DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
Figure 4 for DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
Viaarxiv icon

C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation

Add code
Feb 27, 2025
Figure 1 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Figure 2 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Figure 3 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Figure 4 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Viaarxiv icon